Intentional voice command detection for completely hands-free speech interface in home environments
نویسندگان
چکیده
We introduce a new class of speech processing, called Intentional Voice Command Detection (IVCD). It is necessary to reject not only noises but also unintended voices to achieve completely hands-free speech interface. Conventional VAD framework is not sufficient for such purpose, and we discuss how we should define IVCD and how we can realize it. We investigate implementation of IVCD from the viewpoint of feature extraction and classification, and show that the combination of various features and SVM can achieve IVCD accuracy of 93.2% for a large-scale audio database in real home environments.
منابع مشابه
System Request Utterance Detection Based on Acoustic and Linguistic Features
Robots are now being designed to become a part of the lives of ordinary people in social and home environments, such as a service robot at the office, or a robot serving people at a party (H. G. Okuno, et al., 2002 ) (J. Miura, et al., 2003). One of the key issues for practical use is the development of technologies that allow for user-friendly interfaces. This is because many robots that will ...
متن کاملA Real-Time Speech Command Detector for a Smart Control Room
In this work we present an always-on speech recognition system that discriminates spoken commands directed to the system from other spoken input. For discrimination we integrated various features ranging from prosodic cues and decoding features to linguistic information. The resulting ”Speech Command Detector” provides intuitive hands-free user interaction in a Smart Control Room environment wh...
متن کاملTue-SeA Real-Time Speech Command Detector for a Smart Control Room
In this work we present an always-on speech recognition system that discriminates spoken commands directed to the system from other spoken input. For discrimination we integrated various features ranging from prosodic cues and decoding features to linguistic information. The resulting ”Speech Command Detector” provides intuitive hands-free user interaction in a Smart Control Room environment wh...
متن کاملVoice and Noise Detection with AdaBoost
Speech recognition is one of our most effective communication tools when it comes to a hands-free (human-machine) interface. Most current speech recognition systems are capable of achieving good performance in clean acoustic environments. However, these systems require the user to turn the microphone on/off to capture voices only. Also, in hands-free environments, degradation in speech recognit...
متن کاملHands-free human-machine dialogue - corpora, technology and evaluation
In this paper we will review the progress of hands-free, Voice User Interface (VUI) research work at Bell Labs, including: a multichannel data base collection, technology development, and performance evaluation. Thirty-channel, simultaneous recordings have been conducted in a moving car, collecting speech from 57 subjects under various weather, road, and noise conditions. These are being used f...
متن کامل